智能论文笔记

More Efficient Exploration with Symbolic Priors on Action Sequence Equivalences

Toby Johnstone , Nathan Grinsztajn , Johan Ferret , Philippe Preux

分类：机器学习 | 人工智能

2021-10-20

在加强学习算法中纳入先前知识主要是一个开放的问题。即使有关环境动态的见解，也可以在Tabula Rasa设置中使用加固学习，并且必须从头开始探索和学习所有内容。在本文中，我们考虑利用对动作序列等价的前沿的问题：即，当不同的行动序列产生相同的效果时。我们提出了一种新的本地探索策略，以最大限度地减少碰撞并最大限度地提高新的国家审视。我们表明，通过解决凸优化问题，可以几乎没有成本计算该策略。通过在DQN中取代通常的epsilon贪婪策略，我们在具有各种动态结构的若干环境中展示了其潜力。

translated by 谷歌翻译

Acme: A Research Framework for Distributed Reinforcement Learning

Matthew W. Hoffman , Bobak Shahriari , John Aslanides , Gabriel Barth-Maron , Nikola Momchev , Danila Sinopalnikov , Piotr Stańczyk , Sabela Ramos , Anton Raichuk , Damien Vincent

分类：机器学习 | 人工智能

2020-06-01

深度强化学习（RL）导致了许多最近和开创性的进步。但是，这些进步通常以培训的基础体系结构的规模增加以及用于训练它们的RL算法的复杂性提高，而均以增加规模的成本。这些增长反过来又使研究人员更难迅速原型新想法或复制已发表的RL算法。为了解决这些问题，这项工作描述了ACME，这是一个用于构建新型RL算法的框架，这些框架是专门设计的，用于启用使用简单的模块化组件构建的代理，这些组件可以在各种执行范围内使用。尽管ACME的主要目标是为算法开发提供一个框架，但第二个目标是提供重要或最先进算法的简单参考实现。这些实现既是对我们的设计决策的验证，也是对RL研究中可重复性的重要贡献。在这项工作中，我们描述了ACME内部做出的主要设计决策，并提供了有关如何使用其组件来实施各种算法的进一步详细信息。我们的实验为许多常见和最先进的算法提供了基准，并显示了如何为更大且更复杂的环境扩展这些算法。这突出了ACME的主要优点之一，即它可用于实现大型，分布式的RL算法，这些算法可以以较大的尺度运行，同时仍保持该实现的固有可读性。这项工作提出了第二篇文章的版本，恰好与模块化的增加相吻合，对离线，模仿和从演示算法学习以及作为ACME的一部分实现的各种新代理。

translated by 谷歌翻译

Sparse neural networks with skip-connections for nonlinear system identification

Erlend Torje Berg Lundby , Haakon Robinsson , Adil Rasheed , Ivar Johan Halvorsen , Jan Tommy Gravdahl

分类：机器学习

2023-01-02

Data-driven models such as neural networks are being applied more and more to safety-critical applications, such as the modeling and control of cyber-physical systems. Despite the flexibility of the approach, there are still concerns about the safety of these models in this context, as well as the need for large amounts of potentially expensive data. In particular, when long-term predictions are needed or frequent measurements are not available, the open-loop stability of the model becomes important. However, it is difficult to make such guarantees for complex black-box models such as neural networks, and prior work has shown that model stability is indeed an issue. In this work, we consider an aluminum extraction process where measurements of the internal state of the reactor are time-consuming and expensive. We model the process using neural networks and investigate the role of including skip connections in the network architecture as well as using l1 regularization to induce sparse connection weights. We demonstrate that these measures can greatly improve both the accuracy and the stability of the models for datasets of varying sizes.

translated by 谷歌翻译

All's well that FID's well? Result quality and metric scores in GAN models for lip-sychronization tasks

Carina Geldhauser , Johan Liljegren , Pontus Nordqvist

分类：计算机视觉 | (统计)机器学习

2022-12-28

We test the performance of GAN models for lip-synchronization. For this, we reimplement LipGAN in Pytorch, train it on the dataset GRID and compare it to our own variation, L1WGAN-GP, adapted to the LipGAN architecture and also trained on GRID.

translated by 谷歌翻译

Metadata-guided Consistency Learning for High Content Images

Johan Fredin Haslum , Christos Matsoukas , Karl-Johan Leuchowius , Erik Müllers , Kevin Smith

分类：计算机视觉

2022-12-22

High content imaging assays can capture rich phenotypic response data for large sets of compound treatments, aiding in the characterization and discovery of novel drugs. However, extracting representative features from high content images that can capture subtle nuances in phenotypes remains challenging. The lack of high-quality labels makes it difficult to achieve satisfactory results with supervised deep learning. Self-Supervised learning methods, which learn from automatically generated labels has shown great success on natural images, offer an attractive alternative also to microscopy images. However, we find that self-supervised learning techniques underperform on high content imaging assays. One challenge is the undesirable domain shifts present in the data known as batch effects, which may be caused by biological noise or uncontrolled experimental conditions. To this end, we introduce Cross-Domain Consistency Learning (CDCL), a novel approach that is able to learn in the presence of batch effects. CDCL enforces the learning of biological similarities while disregarding undesirable batch-specific signals, which leads to more useful and versatile representations. These features are organised according to their morphological changes and are more useful for downstream tasks - such as distinguishing treatments and mode of action.

translated by 谷歌翻译

ECG-Based Electrolyte Prediction: Evaluating Regression and Probabilistic Methods

Philipp Von Bachmann , Daniel Gedon , Fredrik K. Gustafsson , Antônio H. Ribeiro , Erik Lampa , Stefan Gustafsson , Johan Sundström , Thomas B. Schön

分类：计算机视觉 | 机器学习

2022-12-21

Objective: Imbalances of the electrolyte concentration levels in the body can lead to catastrophic consequences, but accurate and accessible measurements could improve patient outcomes. While blood tests provide accurate measurements, they are invasive and the laboratory analysis can be slow or inaccessible. In contrast, an electrocardiogram (ECG) is a widely adopted tool which is quick and simple to acquire. However, the problem of estimating continuous electrolyte concentrations directly from ECGs is not well-studied. We therefore investigate if regression methods can be used for accurate ECG-based prediction of electrolyte concentrations. Methods: We explore the use of deep neural networks (DNNs) for this task. We analyze the regression performance across four electrolytes, utilizing a novel dataset containing over 290000 ECGs. For improved understanding, we also study the full spectrum from continuous predictions to binary classification of extreme concentration levels. To enhance clinical usefulness, we finally extend to a probabilistic regression approach and evaluate different uncertainty estimates. Results: We find that the performance varies significantly between different electrolytes, which is clinically justified in the interplay of electrolytes and their manifestation in the ECG. We also compare the regression accuracy with that of traditional machine learning models, demonstrating superior performance of DNNs. Conclusion: Discretization can lead to good classification performance, but does not help solve the original problem of predicting continuous concentration levels. While probabilistic regression demonstrates potential practical usefulness, the uncertainty estimates are not particularly well-calibrated. Significance: Our study is a first step towards accurate and reliable ECG-based prediction of electrolyte concentration levels.

translated by 谷歌翻译

On The Relevance Of The Differences Between HRTF Measurement Setups For Machine Learning

Johan Pauwels , Lorenzo Picinali

分类：人工智能 | 机器学习

2022-12-08

As spatial audio is enjoying a surge in popularity, data-driven machine learning techniques that have been proven successful in other domains are increasingly used to process head-related transfer function measurements. However, these techniques require much data, whereas the existing datasets are ranging from tens to the low hundreds of datapoints. It therefore becomes attractive to combine multiple of these datasets, although they are measured under different conditions. In this paper, we first establish the common ground between a number of datasets, then we investigate potential pitfalls of mixing datasets. We perform a simple experiment to test the relevance of the remaining differences between datasets when applying machine learning techniques. Finally, we pinpoint the most relevant differences.

translated by 谷歌翻译

Automatically generating question-answer pairs for assessing basic reading comprehension in Swedish

Dmytro Kalpakchi , Johan Boye

分类：自然语言处理

2022-11-28

This paper presents an evaluation of the quality of automatically generated reading comprehension questions from Swedish text, using the Quinductor method. This method is a light-weight, data-driven but non-neural method for automatic question generation (QG). The evaluation shows that Quinductor is a viable QG method that can provide a strong baseline for neural-network-based QG methods.

translated by 谷歌翻译

Efficient brain age prediction from 3D MRI volumes using 2D projections

Johan Jönemo , Muhammad Usman Akbar , Robin Kämpe , J Paul Hamilton , Anders Eklund

分类：计算机视觉 | 机器学习

2022-11-10

Using 3D CNNs on high resolution medical volumes is very computationally demanding, especially for large datasets like the UK Biobank which aims to scan 100,000 subjects. Here we demonstrate that using 2D CNNs on a few 2D projections (representing mean and standard deviation across axial, sagittal and coronal slices) of the 3D volumes leads to reasonable test accuracy when predicting the age from brain volumes. Using our approach, one training epoch with 20,324 subjects takes 40 - 70 seconds using a single GPU, which is almost 100 times faster compared to a small 3D CNN. These results are important for researchers who do not have access to expensive GPU hardware for 3D CNNs.

translated by 谷歌翻译

Unifying Model-Based and Neural Network Feedforward: Physics-Guided Neural Networks with Linear Autoregressive Dynamics

Johan Kon , Dennis Bruijnen , Jeroen van de Wijdeven , Marcel Heertjes , Tom Oomen

分类：机器学习

2022-09-26

未知的非线性动力学通常会限制前馈控制的跟踪性能。本文的目的是开发一个可以使用通用函数近似器来补偿这些未知非线性动力学的前馈控制框架。前馈控制器被参数化为基于物理模型和神经网络的平行组合，在该组合中，两者都共享相同的线性自回旋（AR）动力学。该参数化允许通过Sanathanan-Koerner（SK）迭代进行有效的输出误差优化。在每个Sk-itteration中，神经网络的输出在基于物理模型的子空间中通过基于正交投影的正则化受到惩罚，从而使神经网络仅捕获未建模的动力学，从而产生可解释的模型。

translated by 谷歌翻译